A Hierarchical Bayesian Model for Topic Segmentation
نویسندگان
چکیده
Many streams of real-world data, such as conversations or body movements, consist of relatively coherent segments, each characterized by particular topics or controllers. Making sense of these data requires simultaneously segmenting the sequences and inferring the structure of the segments. We present a hierarchical Bayesian model that can be used to break a sequence of utterances or movements into segments with different distributions over topics or controllers. We apply this model to a database of meetings, showing that its unsupervised segmentation is competitive with other approaches, and a database of human hand movements, revealing some of the controllers for motions of the hand.
منابع مشابه
Topic Segmentation with a Structured Topic Model
We present a new hierarchical Bayesian model for unsupervised topic segmentation. This new model integrates a point-wise boundary sampling algorithm used in Bayesian segmentation into a structured topic model that can capture a simple hierarchical topic structure latent in documents. We develop an MCMC inference algorithm to split/merge segment(s). Experimental results show that our model outpe...
متن کاملTraffic Scene Analysis using Hierarchical Sparse Topical Coding
Analyzing motion patterns in traffic videos can be exploited directly to generate high-level descriptions of the video contents. Such descriptions may further be employed in different traffic applications such as traffic phase detection and abnormal event detection. One of the most recent and successful unsupervised methods for complex traffic scene analysis is based on topic models. In this pa...
متن کاملHierarchical Text Segmentation from Multi-Scale Lexical Cohesion
This paper presents a novel unsupervised method for hierarchical topic segmentation. Lexical cohesion – the workhorse of unsupervised linear segmentation – is treated as a multi-scale phenomenon, and formalized in a Bayesian setting. Each word token is modeled as a draw from a pyramid of latent topic models, where the structure of the pyramid is constrained to induce a hierarchical segmentation...
متن کاملA Model for Tax Evasion Forcasting based on ID3 Algorithm and Bayesian Network
Nowadays, knowledge is a valuable and strategic source as well as an asset for evaluation and forecasting. Presenting these strategies in discovering corporate tax evasion has become an important topic today and various solutions have been proposed. In the past, various approaches to identify tax evasion and the like have been presented, but these methods have not been very accurate and the ove...
متن کاملDeveloping a Macro-segmentation Model at Industry Level: Iranian Banking Industry
Drastic changes and turbulence in macro-economic factors have the greatest impact on banks target market attractiveness in Iran. It is assumed that conventional segmentation models at the corporate level are not efficient for banking system. This study aims to develop a new segmentation model at the industry level for banks of Iran. For this purpose, structures and variables at the industry lev...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005